i • 



'1 



UNITED STATES PATENT APPLICATION 



for 



A MOTION DETECTING WEB CAMERA SYSTEM 



Inventor: 
Bradford H. Needham 



Prepared by: 

BLAKELY, SOKOLOFF, TAYLOR & ZAFMAN, LLP 

12400 Wilshire Boulevard 
Los Angeles, CA 90025-1026 
(408) 720-8598 



Attorney Docket No: 42390. P7389 



A Motion Detecting Web Camera System 



Related Application 

This application is related to Application No. 
Microphone System", filed concurrently herewjtjvaTta^Application No. 09/124,179 
entitled, "Digital Opaque ProjecjojJerEasy Creation and Capture of Presentation 
Material", filed JuJyJ28TT998, which applications are assigned to the assignee of 
thej^rjseeTifapplication . 

Field Of The Invention 

The present invention relates generally to the field of video capture; more 
particularly, to video camera systems having information processing capabilities 
for uploading pictures to a web server. 

Background of the Invention 

A web camera (i.e., "webcam") system consists of a video camera plus 
software that runs on a personal computer to periodically upload an image from 
the camera to a web page. The basic purpose of a web camera system is to 
post a reasonably live picture on a user-specified web page. Many webcam 
systems upload images on a periodic basis; for example, uploading an image 
once per hour. 

Figure 1 illustrates a web camera system 10, which is typical of the prior 
art. System 10 includes a video camera 1 1 that outputs a captured video image 
to a personal computer (PC) 12. Software running on PC 12 functions to 
periodically upload the captured video image to an Internet web page (i.e., a web 
server) shown in Figure 1 by block 13. Internet service providers (ISPs) 
commonly provide their patrons with a certain allocation of web page space for 
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personal use. This allows the user to upload images onto their web page 
periodically; with the frequency of uploading being dependent on the particular 
type of connection offered by the ISP. 

Presently, there are two shareware products in existence that relate to 
web cameras: Ispy ™ and Webcam32™. The Ispy webcam software functions 
to grab video images, save them as JPEG files, and then send the saved images 
automatically to a user-specified home page via the connection provided by the 
users' ISP. Ispy runs under Windows™ 95, Windows 98 and Windows NT 4.0; it 
also works with any video for Windows-compatible cameras and frame grabbers. 
Webcam32 is a Windows 95, Windows 98 and Windows NT application that 
allows video camera images to be displayed within a web page. Like Ispy, 
Webcam32 software is able to upload images to a web server to allow images to 
be obtained directly from the page. Both of these products include various 
simple image-processing features such as captioning of photos, day/time 
stamping, and text additions. 

Webcam32™ software also offers rudimentary motion detection, which is 
of primary use in security surveillance applications. For example, the 
Webcam32™ software allows images to be uploaded when, say, 25% of the 
pixels in the image frame change from one image frame to the next. Although 
this motion-detecting feature of the software product is useful in limited types of 
motion detection applications (e.g., security surveillance), it is not useful for 
different applications. For example, if the web camera system is intended for use 
in observing and recording wildlife activity, then this type of rudimentary motion 
detection does not work well. 

Another problem with today's webcam systems is the conflict between the 
desire to minimize the number of times the web camera contacts the ISP and the 
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need to capture "interesting" pictures (i.e., those containing certain kinds of 
motion). Most security surveillance type of web camera systems have a low 
threshold that results in the taking of many pictures whenever activity is detected. 
Uploading many pictures onto a web page presents a serious bandwidth 
5 problem. 

Furthermore, existing products such as Ispy and Webcam32 only provide 
the ability to capture images on a given schedule, e.g., once per hour, or 
whenever motion occurs, regardless of how often. If set to capture on a 
predetermined schedule, images that may be of interest to the user may end up 
10 being ignored. On the other hand, if the software is set to upload a video image 
whenever motion is detected, scenes containing frequent motion can tie up the 
user's phone lines. 

Thus, there is a need for a web camera system that overcomes the 
problems inherent in the prior art. 



15 
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Summary of the Invention 



The present invention is a camera system for connection to a web server. 
The system comprises a video camera and a processor that periodically uploads 
images captured by the video camera in accordance with one of a plurality of 
5 motion detection algorithms. A first motion detection algorithm captures a current 
image frame when a pixel comparison between successive image frames 
exceeds a predetermined threshold. 
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Brief Description of the Drawings 

The present invention will be understood more fully from the detailed 
description which follows and from the accompanying drawings, which however, 
should not be taken to limit the invention to the specific embodiments shown, but 
5 are for explanation and understanding only. 



10 utilized in accordance with one embodiment of the present invention. 

Figure 4 is^a flowchart illustrating one of the motion-detecting algorithms 
utilized in accordance with one embodiment of the present invention. 




Figure jrfs a conceptual diagram of various motion-detecting algorithms 
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Detailed Description 

A web camera system that operates in accordance with one of a plurality 
of motion detection algorithms is described. In the following description, 
numerous specific details are set forth, such as specific operating modes, 
procedures, circuit elements, etc., in order to provide a thorough understanding 
of the invention. It will be obvious, however, to one skilled in the art, that these 
specific details may not be needed to practice the present invention. 

The state-of-the-art of web camera systems is such that there exists a 
conflict between the desire to minimize the number of times the web camera 
system dials up the Internet service provider and the desire to capture 
"interesting" pictures (e.g., those containing certain particular kinds of motion). 
As previously discussed, existing software products utilized to capture video 
images permit some rudimentary motion detection. These programs are utilized 
in applications concerned primarily with uploading images the instant motion is 
detected. While such programs are suitable for use in applications such as 
security surveillance systems, they suffer problems when used in different 
applications, e.g., observation of wildlife activity. 

Figure 2 illustrates three image frames 21-23 that may be captured 
utilizing a web camera system. In this case, the picture of interest is a bird 
feeding at a bird feeder station. The web camera is directed at the station, and 
operates to upload captured images either periodically, or in response to 
detected motion, or both. The problem that exists with prior art web camera 
systems that periodically upload images, say, every half-hour, is that a bird may 
arrive and leave many times within that time interval. In the event that a bird is 
not present at the scheduled image capture time, the only picture that will be 
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captured is one of an empty bird feeder. By way of example, this is the situation 
represented by image frame 21 . 

If a motion detecting feature is included in the web camera system, the 
taking of a picture may be triggered each time movement above a certain 
threshold is detected. In this case, when the number of pixels between two 
successive pictures changes (above a predetermined threshold) a new picture is 
captured and uploaded to the user's specified web site. By way of example, 
frame 22 of Figure 2 illustrates the arrival of a bird at the bird feeder, which 
triggers the video capture of an image frame. Note that many web camera 
systems in use today typically have low threshold settings that result in the taking 
of many pictures when the slightest activity is detected. This results in a 
bandwidth problem for the connection to the Internet service provider. 

Another concern relates to movement of the bird when it takes flight to 
leave the bird feeder station. Existing web camera systems with motion 
detecting features will trigger on this type of motion. Unfortunately, the last video 
image captured as a result of this type of motion is an empty bird feeder station, 
as represented by frame 23. In other words, the most recently captured image 
for uploading to the user's web site is that of the empty bird feeder, rather than 
the desired image of wildlife activity. 

The present invention^solve^ the problem of motion detection and timed 
update by uploading one image each predetermined interval selecting the best 
candidate video image that occurred during any given interval. The camera 
system includes a video camera coupled to a processor that operates in 
accordance with one of a plurality of motion detection algorithms to select an 
image for uploading to the user's web site. 
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The basic algorithm is as follows. Motion is first detected by performing a 
pixel comparison between successive image frames. When the pixel comparison 
between a current image frame and a previous image frame exceeds a 
predetermined threshold level, software running on the processor saves the 
5 current picture as a candidate for uploading. A simple frame grabber technology 
may be utilized for capture of the video image. 

The processor may be programmed to periodically upload the current or 
last candidate image. For example, a typical web camera system may operate 
by uploading the last candidate image once every hour. If no motion is detected 
10 in the past hour, there are two options: either no uploading of any image, or 
upload a current image captured by the camera regardless of whether motion is 
detected. 

Figure 3 illustrates a conceptual diagram of various detecting algorithms 
utilized in accordance with one embodiment of the present invention. Details of 
15 this embodiment are provided below and also in the attached Appendix, which 
contains a code listing of a software program that implements these algorithms. 

With reference to Figure 3, a camera outputs a color image, which is then 
converted to a black-and-white (B/\N) pixel representation of a current image 
frame. Three different modes of motion detection are then provided to capture 

20 specific types of events. In the basic mode of motion detection, the web camera 
system selects the most recent image containing motion above a certain 
threshold. This image represents the candidate image for uploading to the user- 
specified web server. As previously discussed, the basic mode of motion 
detection involves a straightforward pixel-by-pixel comparison between a current 

25 frame and a previously captured image frame. 
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A second type of motion detection mode of operation is referred to as 
"stable-change" detection. The stable-change mode of motion detection is 
designed to capture persistent changes in a scene, while ignoring relatively 
simple motions. For example, the stable-change mode of operation is useful in 
5 video conferencing applications where the video camera is aimed at a desk or 
whiteboard where a speaker is placing written subject matter or other objects in 
front of the camera to facilitate discussion. In such applications, it is useful to 
capture the notes written on a board, or otherwise presented in front of the 
camera, while ignoring extraneous motion such as writing on the board, finger 
10 pointing, etc. 

The stable-change mode of operation is aimed at detecting stable 
changes to a particular scene and operates in accordance with an algorithm that 
captures an image frame a certain time period following a last detected motion 
event. The time duration may be programmable, timed, or fixed depending on 

15 what is being viewed and what is to be captured. In the video conferencing 
application discussed above, software running on the processor operates to 
capture a video image frame and copy or upload it to remote sites whenever a 
new writing (or other object for presentation) is placed in front of the video 
camera. The stable-change mode of operation ignores constant ongoing activity 

20 in the field of view. 

As can be seen in Figure 3, the stable-change algorithm compares a 
current frame against a last stable frame and selects as a candidate picture the 
last stable frame when no motion has been detected for a certain duration (e.g., 
two seconds). 

25 The third mode of motion detection operation is referred to in this 

application as "novel" motion detection. The novel motion detection mode of 
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operation solves the problem that arises in certain applications such as 
observation of wildlife activity wherein the motion of a bird arriving at a feeder is 
very similar to the motion of the bird departing from the feeder. A webcam 
system operating in accordance with only a basic motion detection algorithm -- 
which simply saves the most recent image with motion -- cannot distinguish 
between these two types of events. In other words, the basic motion detection 
algorithm captures the image of the just vacated bird feeder for uploading to the 
web server, rather than the picture of the bird that left, simply because it triggers 
on the last motion detected. 

In solving this problem, the novel motion detection algorithm compares an 
image that contains motion against the most recent stable image, as described 
above. Images that are not substantially different from the stable image frame 
are ignored. 

Figure 3 illustrates that novel motion detection involves not only detection 
of motion, based on a pixel comparison of a current frame against a previous 
frame, but also a comparison between the current frame and a last stable frame. 
For example, as applied to the empty bird feeder problem, the last stable frame is 
that of the recently vacated bird feeder. According to this third algorithm, motion 
is recorded instead of the absence of motion. That is, the detection of motion, 
based on a pixel comparison between the current frame and previous frame, 
triggers capture of a current image frame in a buffer. The buffer may be any one 
of a variety of buffers, such as a circular buffer, with a capacity to store a 
sufficient number of image frames. 

In this mode, each time motion is detected, the current image is captured 
into a circular buffer. In other words, in the novel motion detection mode of 
operation, pictures are captured into a buffer at times other than the last stable 
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frame. The detection of a last stable frame triggers the uploading of an image 
from the circular buffer that was captured some predetermined time prior to the 
triggering event. In the bird feeder example, the uploaded image might be an 
image frame captured several seconds prior to the last detected motion, e.g., an 
5 image of the bird prior to leaving. 

In one embodiment of the present invention the selected image represents 
a candidate picture that may be uploaded to a web server at a predetermined 
interval. Generally speaking, the user may set the interval for uploading the last 
candidate picture, as well as the particular mode of motion detection to be 
utilized. 




/ A'9 ure 4 is a flow chart illustrating the novel motion detection algorithm in 
^Tccordarfce with one embodiment of the present invention. The flow chart begins 
at step 31 , Vhich indicates the capture of a current image frame by the video 
camera. If aVolored video camera is utilized, the color image may be 
15 transformed tc\a black and white pixel representation. At step 33, a pixel 

comparison is made between the current frame and a previously captured frame. 
If the pixel comparison indicates that the number of pixels between the two 
frames exceeds a predetermined threshold, then the algorithm proceeds to step 
34. If the predetermined threshold is not exceeded (i.e., no motion is detected) 
20 the flow chart returns t\ step 31 . 

When basic motion is detected, the pixel comparison causes a motion 
signal to be asserted by logic circuitry in the processor. This is illustrated in 
Figure 4 by step 34. Assertion of the motion signal causes the current image to 
be loaded into a candidate buffer which holds the most recent images for periodic 
25 uploading to a web site. 
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Both the current image frame and the previous image frame may be held 
in separate buffers after being captured by the video camera. In this particular 
mode of operation, a circular buffer may be utilized as a candidate buffer for 
holding the most recent images captured responsive to the motion signal. The 
5 storing of the image frames in the circular buffer is represented in Figure 4 by 
step 35. 

}Py yl?teP 36 is a determination of whether a stable image frame is detected. If 
J^rtot, theVilgorithm returns to the beginning step 31 . On the other hand, if a stable 
frame is detected, then one of the stored frames is selected for uploading to a 
0 web serveV The particular frame that may be chosen is the frame that occurred 
a predetenAned time prior to the detection of a stable frame. For example, in the 
bird feeder example, it is useful to select an image that was captured several 
seconds prior tp the last detected motion of the bird leaving the feeder. 
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